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FIG: 1 



A) Sequence inserted in 3'-UTR of YFP plasmid 



Name 


Insert Sequence For siRNA Design 


Length 


E2-1- 
(CDC34) 


CAAGGGGCTGCAGGAAGAGCCGGTCGAGGGATTCCGCGTGACACTG 

GTGGACGAGGGCGATCTATACAACTGGGAGGTGGCCATCTTCGGGC 

CCCCCAACACCTACTACGAGGGCGGCTACTTCAAGGCGCGCCTCAA 

GTTCCCCATCGACTACCCATACTCTCCACCAGCCTTTCGGTTCCTGA 

CCAAGATGTGGCACCCTAACATCTACGAGACGGGGGACGTGTGTATC 

TCCATCCTCCACCCGCCGGTGGACGACCCCCAGAGCGGGGAGCTGC 

CCTCAGAGAGGTGGAACCCCACGCAGAACGTCAGGACCATTCTCCT 

GAGTGTGATCTCCCTCCTGAACGAGCCCAACACCTTCTCGCCCGCAA 

ACGTGGACGCCTCCGTGATGTACAGGAAGTGGAAAGAGAGCAAGGG 

GAAGGATCGGGAGTACACAGACATCATCCGGAAGCAGGTCCTGGGG 

ACCAAGGTGGACGCGGAGCGTGACGGCGTGAAGGTGCCCACCACG 

CTGGCCGAGTACTGCGTGAAGACCAAGGCGCCGGCGCCCGACGAG 

GGCTCAGACCTCTTCTACGACGACTACTACGGGACGGCGAGGTGGA 

GGAGGAGG 


607 



B) randomly selected siRNA targeting the 3'-UTR insert 





ID 


Antisense Strand 


Sense Strand 


CDC34_18 . 


16880 


CCGUCCCGUAGUAGUCGUCgt 


GACGACUACUACGGGACGGtt 


CDC34_19 


16881 


CGUCCCGUAGUAGUCGUCGta 


CGACGACUACUACGGGACGtt 


CDC34_22 


16882 


CCCGUAGUAGUCGUCGUAGaa 


CUACGACGACUACUACGGGtt 


CDC34_28 


16883 


GUAGUCGUCGUAGAAGAGGtc 


CCUCUUCUACGACGACUACtt 


CDC34_29 


16884 


UAGUCGUCGUAGAAGAGGUct 


ACCUCUUCUACGACGACUAtt 


CDC34_34 


16885 


GUCGUAGAAGAGGUCUGAGcc 


CUCAGACCUCUUCUACGACtt 


CDC34_41 


16886 


AAGAGGUCUGAGCCCUCGUcg 


ACGAGGGCUCAGACCUCUUtt 


CDC34_42 


16887 


AGAGGUCUGAGCCCUCGUCgg 


GACGAGGGCUCAGACCUCUtt 


CDC34_46 


16888 


GUCUGAGCCCUCGUCGGGCgc 


GCCCGACGAGGGCUCAGACtt - 


CDC34_58 


16889 


GUCGGGCGCCGGCGCCUUGgt 


CAAGGCGCCGGCGCCCGACtt 


CDC34_61 


16890 


GGGCGCCGGCGCCUUGGUCtt 


GACCAAGGCGCCGGCGCCCtt 


CDC34_63 


16891 


GCGCCGGCGCCUUGGUCUUca 


AAGACCAAGGCGCCGGCGCtt 


CDC34_72 


16892 


CCUUGGUCUUCACGCAGUAct 


UACUGCGUGAAGACCAAGGtt 


CDC34_75 


16893 


UGGUCUUCACGCAGUACUCgg 


GAGUACUGCGUGAAGACCAtt 


CDC34_77 


16894 


GUCUUCACGCAGUACUCGGcc 


CCGAGUACUGCGUGAAGACtt 


CDC34_80 


16895 


UUCACGCAGUACUCGGCCAgc 


UGGCCGAGUACUGCGUGAAtt 


CDC34_84 


16896 


CGCAGUACUCGGCCAGCGUgg 


ACGCUGGCCGAGUACUGCGtt 


CDC34_85 


16897 


GCAGUACUCGGCCAGCGUGgt 


CACGCUGGCCGAGUACUGCtt 


CDC34_90 


16898 


ACUCGGCCAGCGUGGUGGGca 


CCCACCACGCUGGCCGAGUtt 
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ID 


MiiUoOiat? oil ai iu 


O61I9C Oil dllU 


CDC34 99 


168QQ 


HPPl IPPI ICiPf^PAPPl JUPAPnr 


pi ip a a ppi IPPPPAPP APPiPtt 




I US70U 


pi ipapppi ipppppi ipPAPPtt 


PPI IPP APPPPP APiPril IPAPH 
bbUbbAbbbboMov^ouoAbu 


nnn^i 19a 


1RQ01 


PPPPPUPPAPPl H IPPI )CCCra 


(^PPAPPAAPPI 1 Pf^ APf^ Pf^Ptt 


CDC34 138 


16902 


PPI JUPftl IPPPPAPPAPPl JPrt 


PAf^rii ippi ir^pr^r^APPAAP^r^tt 


CDC34 143 


16903 


GUPPPPAPC5APPI JGPlJUPPon 


n^AAnrAf;rii jppi it^ttc^Pi APtt 


CDC34 153 


16904 


PPI J PPI Jl J PPPP AUPAl JC^I JPtn 


r^APAi ipai irrnfiAARrAPntt 


CDC34 156 


16905 


fiPl Jl IPPHPAl IPAI JPl JPl JPl Int 


APAHAP Al IPAI IPPnnAAr^Ptt 


CDC34 163 


1 RQOR 


PAl IPAI 1 r^l IPl IPl ir^l 1API IPPrn 


pp.Ap,| lAPAPAf^APAl IPAI IPtt 


nnn?4 17? 


1RQ07 


I JPI IPl JAPI IPPPPAI IPPI II IPrr I 


PAAPPAI tpppp A PI JAPAPAtt 


CDP34 173 


I \J\j\JO 


PI IPl IAPI IPPPPAl IPPI 11 \CCrr 


PP A APP Al IPPPP API lAPAPtt 


CDC34 180 

V^ !_»» Vj* 0*T__ 1 UU 


16Q0Q 


PPPPAI IPPI II JPPPPl II IPPI Irt i 


APPAAP^PPP^AAPPAI IPf^f^Ptt 


nnn34 181 


1RQ10 


pp^Al JPPl 11 IPPPPI II IPPI IPtr 


PAPPAAPPPPAAPPAI JPPlpft 


PDP34 909 


1RQ1 1 


1 II II IPPAPI II IPPI IPl IAPAI IPar 


P.AI IP.I 1 AP APPA API irif^AAAH 
bnU b U AUMo OnnO U O OnMAU 




1RQ1 9 


1 II IPPI IPl IAPAI IPAPPPAPPrn i 


PPI IPPPI IPAI IPl IAPAPPAAH 
uUUUUbUbnUbUAvvAbbAAU 




1RQ1 ^ 


ppi ipi lAPAi ip aph^ Anrir^tr 


r>r>f pi IPPPI IPAI IPl 1 AP A PPtf 


PDP34 917 


1RQ14 


pai ip appp Arjnpr^i IPPAPPH 


PPI IPPAPPPPI IPPPI IPAI IPtt 


pnr^4 91Q 


1RQ1 ^ 


i iPAPrj^A^prri ipf a fp i il it/-. 


AAPPI IPPAPPPPI IPPPI I P AH 
AAbbUbbAbbbbUbUbUbATI 




1 RQ 1 R 


PPPPPP A A PP 1 IPl II IPPPr»f 


pppa apappi 11 ipi tr* t*^ f** 

bUbMbAUbUUbUbbbbbbtt 




1 RQ 1 7 


PI II IPPPPI IPPI II IP APP APP<~*-=» 


PPI IPPI IP A APP APPPP A A Ptt 
bbUUlfUbAAUbAbUbbAAUTl 




1 RQ 1 Q 

ioy i o 


1 IPPI II IP A PP A PPP A P A 1 ITAn 

UUbUUuAbbAbbbnbAUUACa 


1 IPAI IPl IPPPI IPPI IPAAPPA+t 

UbAUUUbbbUbbUbAAbbAU 


pnp^A 9R9 


1RQ1 Q 

i oy i y 


PI It IP APP APPP AP A 1 IPAPA^f 


1 IPl IPAI IPl IPPPI IPPI IPAAPH 
UbUbAUUUUwUbvUbAAbll 


PDPU 9R7 


1RQ9H 


PPAPPPAPAl IPAPAPI IPAPno 


PI IP AP 1 IPl IPAI IPl IPPPI IPPtt 
UUonbUbUbMUUUL>L'V/U v^Oll 


pnp^d 9fift 

\~/LJ\^ 0*r ^ D O 


1RQ91 


PAPPPAPAI IPAPAPI IPAPPian 
OMouuMbnUunV/MUUOAb bag 


PPI IPAPI IPl IPAI IPl IPPPI IPH 


PDP^d. 97f) 


1 RQ99 


PPPAPAI IPAPAPI IPAPPAPoa 
ObbMoMUOnLiMOUv/AbbMoaa 


PI IPPI IPAPI IPl IPAI IPl IPPPtt 


PDP34 97R 


1RQ9Q 


I IPAPAPI IP AP.P, Afi A AI IPPI \rr 


APPAI II IPl IPPI IPiAPlI IPil IPiAtt 
nUuMUU^U^UUbnbUbUbnll 


PDP34 977 


1 RQ9J. 


PAPAPI IPAPPAPAAI IPPI IPrt 


P APP AI II IPl IPPI IPAPI IPl IPtt ! 
bnUUnUUUUUUUbAbUbUbU 


pdp34 983 




PAPiP,AP,AAI IPPI IPPI lPAPPitt 


P^il IPA^iPAPPAl II IPl IPPI \CXf\ 


PDP34 984 


1 RQ9R 


Af^PiAPiAAl IPPI IPPI iriAPP,! Itr 


APPJ IPAP^f^APPAl II IPl IPPI Itt 


PDP34 9QR 


1RQ97 


PI I^APf^l II IPl IPiPf^l IPPPPI |fr 


APPPPAPP.PA(^AAPf^l IPA^itt 


PDP34 303 

V-* L-f V^O*T O WO 


1RQ98 


1 IPl IHPAI IP,P,Pifil II IPPAPPI Irt 


Appl IPpAAPPPPAPHPAPiAtt 


CDC34 316 


16Q9Q 


PPAPPl IPl IPl inAf^nPAf^Ptr 


nPl IPPPPI )PAPA(^Af^ni 1 Pi Ptt 


CDC34_318 


16930 


ACCUCUCUGAGGGCAGCUCcc 


GAGCUGCCCUCAGAGAGGUtt 


CDC34J325 


16931 


UGAGGGCAGCUCCCCGCUCtg 


GAGCGGGGAGCUGCCCUCAtt 


CDC34J326 


16932 


GAGGGCAGCUCCCCGCUCUgg 


AGAGCGGGGAGCUGCCCUCtt 


CDC34_327 


16933 


AGGGCAGCUCCCCGCUCUGgg 


CAGAGCGGGGAGCUGCCCUtt 


CDC34_328 


16934 


GGGCAGCUCCCCGCUCUGGgg 


CCAGAGCGGGGAGCUGCCCtt 


CDC34_348 


16935 


GGUCGUCCACCGGCGGGUGga 


CACCCGCCGGUGGACGACCtt 


CDC34_349 


16936 


GUCGUCCACCGGCGGGUGGag 


CCACCCGCCGGUGGACGACtt 
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ID 


Antisense Strand 


Spn^r* Strand 


CDC34 396 


16937 


CGUAGAUGUUAGGGUGCCAca 


UGGCACCCUAACAUCUACGtt 


CDC34 402 


16938 


UGUUAGGGUGCCACAUCUUaa 


AAGAUGUGGCACCCUAACAtt 


CDC34 441 


16939 


GUGGAGAGUAUGGGUAGUCga 


GACUACCCAUACUCUCCACtt 


CDC34 448 


16940 


GUAUGGGUAGUCGAUGGGGaa 


CCCCAUCGACUACCCAUACtt 


CDC34 454 


16941 


GUAGUCGAUGGGGAACUUGaa 


CAAGUUCCCCAUCGACUACtt 


CDC34 460 


16942 


GAUGGGGAACUUGAGGCGCac 


GCGCCUCAAGUUCCCCAUCtt 


CDC34 461 


16943 


AUGGGGAACUUGAGGCGCGcc 


CGCGCCUCAAGUUCCCCAUtt 


CDC34 468 


16944 


ACUUGAGGCGCGCCUUGAAat 


UUCAAGGCGCGCCUCAAGUtt 


CDC34 478 


16945 


CGCCUUGAAGUAGCCGCCCtc 


GGGCGGCUACUUCAAGGCGtt 


PDP34 482 


16946 


UUGAAGUAGCCGCCCUCGUaa 


ACGAGGGCGGCUACUUCAAtt 


ODP34 484 


1fiQ47 


GAAGUAGCCGCCCUCGUAGta 


CUACGAGGGCGGCUACUUCtt 


nnn^4 4R^ 




AAGl JAGPPGnPPl JPGl JAGl Jan 


ACUACGAGGGCGGCUACUUtt 


PDP34 531 


1RQ4Q 


PPl JPPPAGl ]( JGl JAl JAGAl JPnr 


GAl JPl JAl JAPAAPl IGGGAC5GH 


PDP34 533 




I JPPPAGl Jl )Gl )Al JAGAl JPGPrr 


GPGAUCUAUACAACUGGGAtt 


PDC34 538 


1fiQ51 

I V~> C7 U I 


GUUGl JAl JAGAl JPGPPPUPGtr 


PGAGGGCGAUCUAUACAACtt 


CDC34 540 


16952 


UGUAUAGAUCGCCCUCGUCca 


GACGAGGGCGAUCUAUACAtt 


CDC34_542 


16953 


UAUAGAUCGCCCUCGUCCAcc 


UGGACGAGGGCGAUCUAUAtt 


CDC34_548 


16954 


UCGCCCUCGUCCACCAGUGtc 


CACUGGUGGACGAGGGC GAtt 


CDC34_555 


16955 


CGUCCACCAGUGUCACGCGga 


CGCGUGACACUGGUGGACGtt 


CDC34_569 


16956 


ACGCGGAAUCCCUCGACCGgc 


CGGUCGAGGGAUUCCGCGUtt 


CDC34_578 


16957 


CCCUCGACCGGCUCUUCCUgc 


AGGAAGAGCCGGUCGAGGGtt 


CDC34_586 


16958 


CGGCUCUUCCUGCAGCCCCtt 


GGGGCUGCAGGAAGAGCCGtt 



Normalised potencies of 79 siRN A targeting 3'UTR insert of YFP fusion 
mRNA (50 nM, Oligofectamine, 50h, H1299) 
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FIG: 2 



35 target plasmids: 
2717 siRNA sequences 



I 



I 



Training set: 
2442 siRNA sequences 



1 



Testing set: 
275 siRNA sequences 



exclusion of sequences with more than 30% variation ir 



Filtered training set: 
2109 siRNA sequences 



replicate 



Filtered testing set: 
234 siRNA sequences 
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FIG: 3 
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FIG: 4 



Inactive 0.75 active 



9 False positive (FP) = 
predicted active but inactive 
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22 True positive (TP) = 
predicted active and active 
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141 True negative (TN) = 
predicted inactive and inactive 



62 False negative (FN)= 
Predicted inactive and active 
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FIG: 5 




